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ON THE USE OF EULER’S THEOREM ON ROTATIONS FOR 


THE SYNTHESIS OF ATTITUDE CONTROL SYSTEMS 

By George Meyer 
Ames Research Center 


SUMMARY 


The problem of controlling the rotational position (attitude) of a rigid 
body in three dimensions is discussed. Several control lavs are synthesized 
for this six-dimensional nonlinear control problem by means of some veil-known 
techniques of classical mechanics. 

The system input, output, and error are represented by 3 x 3 orthogonal 
matrices. Euler’s theorem on rotations is employed to express the error matrix 
in terms of the angle of rotation and the direction cosines of the real eigen- 
vector of the error matrix. It is noted that the angle of rotation is a con- 
venient scalar representation of the system error. A class of control lavs 
for which the control torque is a function of the real eigenvector of the error 
matrix and the angular velocity of the controlled body is synthesized. Condi- 
tions are stated for which the system governed by such control lavs is 
asymptotically stable everywhere. 

The results are illustrated with three examples: reaction jet control, 
reaction wheel control, and reaction wheel control with bounded motor torque 
and speed. 


INTRODUCTION 


The need for an attitude control system arises frequently in aerospace 
technology. The Orbiting Astronomical Observatory (see refs. 1 and 2) (0A0) 
provides a typical example. It consists of a telescope rigidly attached to an 
Earth -orbiting satellite. The function of the attitude control system of the 
satellite is to point the telescope in any direction specified by a terrestrial 
astronomer. His commands may, for example, be step changes in attitude going 
from one object in the sky to another, or continuous changes which correspond 
to scanning a portion of the sky or to following a moving object. 

The control problem for this satellite falls into two categories. First, 
while examining a particular object, the satellite is required to hold attitude 
to an extremely high degree of accuracy. This problem has been studied in sev- 
eral works, such as references 1, 2, and 3 * Second, to change from viewing one 
part of the sky to another, large slewing angles are required. This problem is 
considerably more difficult from the analytical standpoint than the first one 
because the describing equations are inherently nonlinear, and any lineariza- 
tion of these equations would be likely to produce meaningless results. Since 
nonlinear equations must be used, the problem is to find a descriptive means 
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which will allow the determination of the proper control for the slewing 
motion. Euler angles are often used for the kinematic description of motion. 
This produces equations that are not only nonlinear but involve complicated 
combinations of trigonometric terms. The determination of the control in 
terms of these variables is difficult, and no complete solution to the large 
angle slew problem exists. 

A solution to the slewing problem is proposed in the present report. The 
solution is complete in the sense that the proposed control laws yield asymp- 
totic stability for all attitudes and attitude changes, and both the kinematic 
and the dynamic nonlinearities are taken into account. The solution is based 
on the well-known fact that three-dimensional rotations may be represented by 
3X3 orthogonal matrices. Both the system input and the system output are 
represented by such matrices, and the system error is defined to be the 3x3 
orthogonal direction cosine matrix corresponding to the rotation between the 
actual and desired attitudes of the vehicle. This definition of system error 
permits the theory of three-dimensional rotations to be applied to the atti- 
tude control problem. In particular, Euler's theorem on rotation (see ref. ^-) 
is employed to define a quantitative representation of the system error as the 
rotation angle of the error matrix. This angle and the magnitude of the angu- 
lar velocity of the body are used to construct Liapunov functions by means of 
which stability of the system is investigated. A class of control laws for 
which the control torque is a function of the real eigenvector of the error 
matrix and the angular velocity of the controlled body is synthesized. For 
small errors in attitude these control laws are like those obtained in the 
Euler angle approach; in addition, they yield asymptotically stable systems 
for all attitudes. 

Several details needed in the main discussion are developed in the 
appendixes. In particular, notation and special functions together with some 
of their properties are summarized in appendix A. Some of the consequences of 
Euler’s theorem on rotation are discussed in appendix B. The dynamic equations 
corresponding to the two methods of generating torque namely by means of 
reaction jets and reaction wheels are derived in the desired form in 
appendix C and several aspects of optimal control are considered in appendix D. 
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real unit eigencolumn of R corresponding to the 
eigenvalue +1 

nonlinear (gyroscopic) part of the dynamic equation 
representation of the angular momentum E in the a basis 
representation of the angular momentum h in the s basis 
initial angular momentum of the system 

maximum angular momentum capacity of a set of reaction wheels 
the identity matrix 

representation of the moment of inertia operator in the 
a basis 

eigenvalues of J a 
maximum eigenvalue of J a 

t 

error matrix defined by A as A^ s 

angular velocity of the a basis relative to the s basis 
angular velocity of the d basis relative to the s basis 
representation of the vector w a in the a basis 

representation of the vector w^ in the d basis 

representation of the control torque in the a basis 

spherical limit on the control torque 

measure of mass asymmetry for the reaction jet control 

measure of mass asymmetry for the reaction wheel control 

inverse of the moment of inertia of a spherically symmetric 
mass 

error angle 

point of saturation in the function sat(cp,cp ) 
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Special Symbols 

y time derivative of the column matrix y 

Y time derivative of the matrix Y 

llyll (y-ty) 172 

yfc transpose of the column y 

transpose of the matrix Y 

Y 1 inverse of the matrix Y 

tr(Y) trace of the matrix Y (see eq. (Al)) 

q(y) column function of the matrix Y (see eq. (A2)) 

S(y) matrix function of the column y (see eq. (A3)) 

sat(cp,cp g ) saturation function = cp/cp s for 0 < cp < cp s and. 1 for cp > cp s 


argmax 

xeX 

f(x) 

that 

value 

of X 

in 

X 

which maximizes 

f(x) 

argmin 

f(x) 

that 

value 

of X 

in 

X 

which minimizes 

f(x) 


xeX 


gradient with respect to (x 1; x 2 ,x 3 ) 


ANALYSIS OF VEHICLE MOTION 


The attitude of a rigid body relative to external space can be completely 
specified by locating a Cartesian set of coordinates fixed in the rigid body 
(see ref. 4). The orientation of the body set of coordinates relative to any 
other set with common origin may be described by a matrix of direction cosines. 
This matrix will be defined as the output of an attitude control system. 

Consider two right-hand orthonormal triplets of vectors with the common 
origin 0: s = (u sl ,u S2 ,u S3 ) and a = (u ai ,u a2 ^%3^ respectively. Let the 

triplet a be fixed in the body., and let the triplet s be fixed in inertial 
space. Then, the attitude of the body relative to s will be defined by a 
transformation which maps s into a. Let such a transformation be denoted by 
A as , so that 


u ai ~ Aas u si > i - 1*2,3 (l) 

Let A as be represented with respect to the s basis by the 3X3 matrix A as . 
Since A as is a matrix of direction cosines (i.e., aqj = u a i • u s j) it is 
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orthogonal . Thus , 

A as A as 1 

where A as is the transpose of A as and I is the identity matrix. Equa- 
tion (2) indicates that the output of an attitude control system may he 
represented by an orthogonal matrix; this fact is of primary importance in the 
present note. The properties of three-dimensional rotation matrices , listed 
in appendix B, are used in the following development. 


Suppose that the triplet a is rotating relative to the triplet s. Then 
A as is a function of time, say A as (t). Hence, according to equation (2) 
Aas(t)A as (t) = I for all t, and 


• . -f* . -h 

A A^ + A A -A A U + 

H as^as ^ ^as^as ^as^as ^ 


(A. 


,aL) 1 = 0 


as Xi as 


* t 

where the dot indicates time differentiation. Consequently, A as A as must 
always be a skew- symmetric matrix, say S. From the definition of angular 
velocity it follows (see appendix B) that if the column matrix w a represents 
(with respect to the a triplet) the angular velocity vector w a of the a 
triplet relative to the s triplet, then (see ref. 5) 


S = s(w a ) 




( 3 ) 


Thus , the rotation matrix A as and the column matrix w a which define the 
attitude and the angular velocity of the controlled rigid body, respectively, 
are connected by the matrix differential equation 


*as = S(v a )A as , A as (0)A* s (0) - I (4) 

Equation ( 4 ) will be referred to as the matrix form of the kinematic equation. 
It applies to all attitude control problems. 

Unlike the kinematic equation, the dynamic equation depends on the partic- 
ular method of generating control torque. In the present report, two methods 
will be discussed. The first is one in which the control torque is external 
(i.e., control by means of a set of reaction jets). The second is one in 
which the total angular momentum of the system is conserved, and the control 
torque is generated internally by a momentum exchange device (i.e., a set of 
reaction wheels). 


The dynamic equations corresponding to these two schemes are derived in 
appendix C. Both equations are of the following form: 
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w a 2 a + ^^as ;¥ a^ (5) 

where J a is a constant matrix, measuring moment of inertia; z a is the column 
matrix representing the control torque vector with respect to the body axes 
(the a triplet) and g is a column matrix (representing the gyroscopic 
acceleration) which is a nonlinear function of the attitude and angular 
velocity of the controlled rigid body. 

When the control torque is external, 

g = Ja 1 S(w a )J a w a (6a) 

where J a is the matrix which represents with respect to the body axes the 
moment of inertia operator of the rigid body being controlled. 

When the control is internal, through a set of reaction wheels, and the 
total system angular momentum is conserved, 

g = j; 1 S(v a )A as h s (0) (6b) 

where Ja represents, relative to the body axes, the inactive moment of 
inertia (i.e., the inertia of the rigid body plus locked wheels minus the 
inertias of the wheels about their spin axes). The column matrix h s (0) 
represents, with respect to inertial space, the total angular momentum of the 
system. It is constant when external torques are absent. 

The kinematic equation together with the appropriate dynamic equation 
describes the controlled object from the point at which the control torque is 
generated to the output attitude. In order to employ feedback control, the 
attitude A as of the controlled body must be known. The function of an 
attitude sensor is to measure A as . 

It will be assumed henceforth that the independent controlling variable 
is the control torque z a and that A as is measurable directly; that is, 
any dynamic elements between the applied voltage and the control torque, and 
between the sensor variables and the corresponding output voltage will be 
neglected in the sequel. The equations of the plant are then the following: 

■^■as “ ^^ ¥ a^as (7a) 


= J a lz a + g( A as' w a) (7b) 

_ Consider, next, the way in which the system error may be defined. Let 
Ads ( t ) be a transformation given as a function of time which defines the 
desired attitude of the controlled body (i.e., the input) relative to the 
s triple (inertial space). The matrix A& s (t) will represent A& s (t) with 
respect to s. The attitude control problem consists in specifying a control 
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law which will force the attitude A as of the controlled body to approximate 
A ds (t) in some sense as nearly as possible consistent with the existing 
constraints* 

When Aas 4 Ads ( t ) an error exists, and the question arises as to how to 
specify it. An obvious choice is to define the error by the matrix 
E = A< 3 _ s - A as with the desired condition given as E = 0. This choice will 
not be made because the significant property of orthogonality would be lost: 

E is not orthogonal. A more advantageous choice for a definition of the error 
is to define it by the orthogonal matrix 

R = A as A ds( t ) (8) 

with the desired condition given by R = I. The matrices A^ s , A as , and R 
will be referred to as the input, output, and error matrices of the system, 
respectively. 

The preceding discussion is summarized in the form of the block diagram 
of figure 1. 



Figure 1.- The block diagram of an attitude control system. 


THE SYNTHESIS OF CONTROL LAWS 


Let the error matrix R be defined by equation (8). According to Euler's 
theorem on rotations (see ref* 4), the matrix R may be considered at every 
instant of time to represent a single rotation. The angle cp of this rotation 
is given by the following equation (see appendix B) . 


9 


cos -1 


jl [tr(R) 



0 < cp < it 


(9) 
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Clearly, 9=0 if and only if R = I; otherwise, cp > 0. It is, there- 
fore, natural to consider cp as a quantitative representation of the attitude 
error. Indeed, a sufficient condition for driving R to I is to maintain the 
time rate of change of 9 negative. 

The time derivative of equation ( 8 ) is 

R = S(w a )R - RS(w d ) (10) 

• -j- 

where S(w d ) depends only on the input. Namely, S(w d ) = A ds A ds* According to 
the property (All) of appendix A, RS(w^) = S(Rw d )R; hence, 

R - S(w a - Rw d )R (11) 

Equation (ll) will be referred to as the kinematic equation of the system 
error. The column matrix w a - Rw^ represents the error velocity vector 
w = w a - Wd with respect to the body axes. 

Let c denote the real unit eigenvector of R. The existence of c is 
guaranteed by Euler's theorem. According to equation (B7a) of appendix B, 
which is true for any rotation matrix, 

9 = ct ( w a ' ^d) 


_ , t_ t 

But c R = c ; hence, 

9 = c t (w a - W d ) ( 12 ) 

Equation (12) implies that any control which maintains the projection of 
w a on c less than the projection of w d on c throughout the control 
interval will force the vehicle into the desired attitude. 


Kinematic Control 


Generally speaking, when the vehicle is controlled by means of an angular 
momentum exchange device (i.e., reaction wheels or control moment gyros) the 
angular momentum capacity of the device is small, while the torque levels of 
the motors, which cause the exchange, are high. Consequently, when cp is far 
from 0 , the time required to perform the exchange is sufficiently short, rela- 
tive to changes in c, to be practically instantaneous. That is, when cp is 
far from 0 , it may be possible to approximate the attitude control system by 
a purely kinematic model. In that case the independent controlling variable 
becomes the angular velocity of the controlled body, and a possible control law 
is the following: 

t 

w a = argmin w a c 

w eW 
a a 

where W a is the set of vehicle angular velocities which are consistent with 
the capacity of the momentum exchange device. 


(13) 
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Small Error Control 


In the neighborhood of the point 9 = 0, the kinematic model becomes 
inadequate because of integration lag. However, in this case the behavior of 
the system may be investigated by means of the following equation, which is 
equivalent to equation (10) to first-order terms in 9 (see appendix B) 

q(R) = S(w d )q(R) - w d + w a (l4) 

where q(R) is a scaled real eigenvector of R, namely, q(R) = sin 9 c. It is 
given explicitly as a continuous function of the elements of the matrix R in 
appendix A by equation (A2). 

For example, suppose that the vehicle is controlled by means of a set of 
reaction wheels, that the total angular momentum of the system is zero, 
that the reference attitude is constant. Then equation ( 14) together with the 
dynamic equations ( 7 b) and ( 6 ) with h g (o) = 0 implies that 

q(R) - = 0 (15) 

In particular, let the control law be the following, where A and B are 
constant matrices. 


z a = -Aq(R) - b4(R) (16) 

Then the behavior of the system near R = I is specified by the following 
differential equation 

q(R) + J a ' L Bq(R) + J^AqfR) =0 (17) 

It may “be noted that the above linear differential equation with constant 
coefficients is independent of the nominal attitude defined by A^ s . On the 
other hand, in the conventional approach (see ref. 3) based on Euler angles 
the coefficients of the perturbation equation are functions of the Euler angles 
of Ads* This means that stability (for example) must be investigated for 
every expected nominal attitude. Such is not the case if equation (17) is used. 

Restricted Dynamic Control 

Next, suppose that neither a kinematic model nor a perturbation model 
describes the system adequately. Suppose that the vehicle is described by 
equation ( 7 ) > and that it is desired to rotate it from one attitude to another 
attitude with zero initial and final velocities. A possible scheme for achiev- 
ing this maneuver is to rotate the vehicle about the eigenvector of the error 
matrix R until the desired attitude is reached. The equations of motion 
corresponding to such a control scheme may be derived as follows. 

According to (12) when w^ = 0, 


<P 


c w. 


a 


(18) 
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Since the control rotates the vehicle about the eigenvector c of R, the 
acceleration w a as well as the velocity w a must be parallel to c. That 
is, the acceleration must be of the following form where f is a scalar. 

w a = -fc (19) 

Consequently, the error angle cp satisfies the following differential equation: 

cp + f = 0 (20) 

If f depends on the state (R,w a ) in such a way that it may be expressed as a 
function of cp and its derivatives only, then (20) is the equation of motion 
of the system. Thus, if the dynamic equation of the system is given by (7b) 
and if the control law 


z a = -f(cp,<p)J a c- J a g(RA ds ,w a ) 


( 21 ) 


is selected, then the following equation of motion of the system results* 


9 + f(<P/P) = 0 


( 22 ) 


It should be noted that the control just described is possible only when 
the initial angular velocity of the vehicle is either zero or an eigenvector 
of the initial error matrix. If there is velocity orthogonal to the eigen- 
vector, then the direction of the eigenvector cannot be kept constant with 
finite accelerations* 


A case for which B(0)w a (0) f w a (0) may be treated as follows, (it is 
still assumed that the input is an attitude step.) Select a control law which 
applies torque so that the vehicle decelerates along w a (0) until time ti 
when w a = 0; then rotate the vehicle about the real eigenvector of the result- 
ing R(ti) as discussed above* Such a control is simple to implement and may 
be attractive in practice if the behavior of the resulting control system does 
not differ much from that of a specified optimal system* For example, suppose 
that a system having the dynamic equation, 


w rt = az,. 


(23) 


where a is a constant scalar (i.e., eq. (5) with J a = i/a and g = 0) and 
||z a || < z is subjected to the following control law: 


z 


a 


p 

-z max w a (0)/ ||w a (0)|| until w a = 0, followed by the 
time-optimal control about the real eigenvector of 
the resulting R. 


(24) 


The behavior of this control law with respect to time-optimality was judged 
by means of a digital computer simulation. The procedure was to drive the 
system from the point (R = I, w a = 0) to an arbitrary point [R(t 0 pt) > ^a(^opt)] 
by means of the time-optimal control (see appendix D) , and then apply the 
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sub optimal control ( 2 k) to bring the resulting point to [R(t Q p t + tg^) = I, 
v a (t op t + t su p) = 0] , and compare the two times t 0 p^ and tg^p. The results 
are summarized in sketch (a). The data indicate that if a departure in the 
response time by 65 percent is permitted, the simple suboptimal control is 
adequate. 



Sketch (a).- The shaded region contains 1500 points corresponding to 
1500 different initial conditions. 


Dynamic Regulator - Asymptotic Stability 

The last result to be included in the present report deals with the con- 
ditions for which attitude control systems, governed by a class of control 
laws, are asymptotically stable* It is assumed that the reference attitude 
A^_ s is constant for t > 0, and that the independent controlling variable is 
the torque z a * The pertinent set of differential equations is the following 
(see eqs. (7b) and (ll) with w & “ 0) • 

R = S(w a )R (25a) 

w a ~ z a §(^\lS' ,w a) (25b) 

Let the state space (see ref* 6) for the set (25) be chosen to be 

X = [ (R,w a ) such that RR^ -1 = 0] 

Suppose that a control law z a = f(R,w a ) is defined on X* The trajectory of 
(25) subjected to this law will be denoted by x(t;x 0 ), where 
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x(0;x o ) = x Q = [R(0 ) , w a (0)] 


A region P defined on X and containing the point (1,0) will be called a 
region of asymptotic stability of (25) subjected to the given control law if 
for every x Q in P, x(t;x Q ) is in P for all t > 0 and x(t;x Q ) -> (l, 0 ) as 

t -*■ oo. 


Let the following two scalar functions be defined on X. 

cp = q>(R) = cos -1 -jjj [tr(R) - l]j- (26a) 

v = w (w a ) = (v^v a ) l/2 (26b) 

where cp is the angle of R defined by equation ( 19 ), and w is the norm of 
w a . The image of X under ( 26 ) will be denoted by Y. Clearly, 

Y = [(cp,w) such that 0 <cp<JT , 0 < w] 

The image under (26) of a trajectory x(t;x Q ) will be denoted by y(t;y Q ) 
where 


y(0;y o ) = y Q = {cp[R(0)], w[w a (0)]} 

In general, through a given point in Y there will pass more than one trajec- 
tory. Hence, in general, it is not possible to consider the trajectories 
y(t;y Q ) to be solutions to a differential equation of the form y = F(y), and 
to construct a region of asymptotic stability on Y from the properties of 
F(y). However, in some cases it is possible to construct a region of asympto- 
tic stability on Y directly from the properties of the trajectories y(t;y )• 
This is done next. 

Let Q(m,n) be a subset of Y defined by 

Q(m,n) = [(cp,w) such that 0<cp<m<at , 0<w<n] 

and let the scalar function V(cp,w) (tentative Liapunov function) be defined 
on Q(m,n) as follows: 

n Cp n W 

V(cp,w) = g 1 (s)ds + [s/g 2 (s)]ds (27) 

where the scalar functions g x (s) and g 2 (s) are such that for m,n > 0 and 
M,N < oo. 


0 < g 1 (s) < M for 
0 < g 2 (s) < N 


0 < s < m, g^O) < M 
for 0 < s < n 
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(28a) 

(28b) 



Then v(0,0) = 0, and v(cp,w) > 0 for other points in Q(m,n) • Moreover, the 
contours V(cp,w) =■ vi and V(cp,w) = v 2 have no points in common if vi / v 2 * 
Finally , V(cp 2 ^0) > V(cpi^O) if, and only if, cp 2 > cpxj similarly, 

V(0,v 2 ) > V(0,vi) if, and only if v 2 > ¥ 1 . Therefore, the sectors in Q 
defined hy 

u(v) = -I (cp>w) such that V(cp,w) < v < v* = min [v(m,0), V(0,n)]l (29) 

L and cp>0,w>0 J 


are nested by v, namely, 

u( V2 ) CZ U(vi) if 0 < V2 < vj < v* 


(See sketch (b)). 



<f 

Sketch (b) 


Hence, if there exists a number b < v* such that for any trajectory y(t;y Q ) , 
with y Q in U(b), the time derivative of v, along that trajectory, is 
negative semidefinite, namely. 


v < 0 

and if v = 0 for at most a countable number of points along the trajectory, 
then (see ref* 7) the region U(b) is a region of asymptotic stability on Y; 
while, the region P(b) of points (R,w a ) in X which maps under ( 26 ) into 
U(b) is a region of asymptotic stability of (25) subjected to the given control 
law* 
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Suppose that the control law has the following form: 

z a = -J a ts 1 ( l ?)g 2 (w)c + g 3 (cp,v)w a ] 


(30) 


where g^cp) and g 2 (w) satisfy conditions (28), c is the eigenvector of R, 
and g 3 (cp,w) satisfies the relation 

^83(9^) > w a g(RA ds ,v a ) (31) 

on a sector U(b) where h > 0; then 

P(b) = [(E,w a ) such that (cp,w) is in U(b)] 


is a region of asymptotic stability of (25) subjected to the control law (30). 


Indeed, 

• - 1 

dcp 

which on using (25b) and (27), the fact that cp = w a c (see eq. (12) with 
w d = 0) and the control law (30), becomes 

v = gg 1 [v^g(RA ds ,w a ) - w^gjcp^)] 


V(9,w)cp + ^7- V(cp,w)w 
ow 


Hence, if condition (31) holds. 


v < 0 


Moreover, from (30), (25b), and (6) (which imply g = 0 if v = 0) it is clear 
that w a ^ 0 for w = 0 and cp / 0. Hence, v = 0 for, at most, a countable 
number of points along the trajectory. Consequently, P(b) is a region of 
asymptotic stability of (25) subjected to the control law (30) with condition 
(31). 


Two special cases of (25) will be used to illustrate the application of 
the above-derived result. The first case is one in which g(RA ds ,w a ) in 
equation (25b) is defined by (6a) and corresponds to control by means of a set 
of reaction jets. The second case is one in which g(RA ds ,w a ) is defined by 
(6b) and corresponds to control by means of a set of reaction wheels. 


Case I : Let the nonlinear part g of the dynamic equation (25b) be 

defined by (6a), and consider the special case of control law (30) with 
g 1 (cp) = kicp,g 2 (w) = 1, g 3 (9,w) = k 2 , where k x and k 2 are constant positive 
scalars. Then (30) becomes essentially a proportional plus rate control: 

z a = -J a (kicpc + k 2 w a ) (32) 
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Clearly, conditions ( 28 ) are satisfied everywhere on Y. According to 
equation ( 27 ), 

V(cp,w) = ~ kicp 2 + - w 2 


Moreover, as is shown in appendix C, where the scalar [i± is defined ( G3 ) ^ 

w^g(EA ds ,w a ) = vfe£'S(vr g )J e y & = ^iw^w^w^ < 3" s/2 biX 3 

Therefore, condition ( 31 ) is satisfied on U(b) where b = (27/2)(k 2 /|_ii) 2 . 
That is, the system is asymptotically stable on the following region of X. 


1 kj. -s cos 1 — 

2 l |_2 


Ilf 


tr(R) -i|K + ±^w a < 


2JJ 


27 


M-i 


It may he noted that when the vehicle has a spherical mass distribution, 
which implies that [± 2 . = 0, the system is asymptotically stable everywhere on 

X. 


Case II : Next suppose that control (32) is applied to a reaction wheel 

control system. The nonlinear part of the dynamic equation is defined by (6b). 
It is shown in appendix C, where the constant scalar [i 2 is defined (eq. C7), 
that 


v£g(RAds,v a ) = wJj; 1 S(w a )RA ds h s (0) < ^ 2 w 2 [h^( 0)h s (0 ) ] 1/2 
Hence, the system is asymptotically stable everywhere on X if 

k 2 > |i 2 j|h s (0) || 

Thus, in both cases the simple "proportional plus rate” control law (32) 
may be used for the synthesis of asymptotically stable attitude control 
systems. 


Usually, a control system must not only be stable, but also be reasonably 
stiff to disturbances and must respond quickly to commands; it must not require 
excessive torques, velocities, power, etc. The freedom which remains in the 
selection of the functions g-j_ In (30) after conditions (28) and (3l) are 
satisfied may be used by a designer to consider such additional qualities of 
the system. 


A contro l law for the 0A0 .- Consider, for example, the 0A0. It is 
controlled by means of a set of reaction wheels, but unlike case II discussed 
above, both the control torque z a and the wheel speeds must not exceed some 
preassigned values, and a fast responding system is desired. A control law 
which may be adequate in such a case is given below: 


J ma.v ^a (*■*- k) sat (cp,cp s ) c + (l + ^) w a/ v max | 


(33) 
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where j is the maximum eigenvalue of the inertia matrix J a and 

max '“' ij 

k = q 2 j it (0)|l w /z <1 
^ u max 11 s' 11 max' max 

cp s = 2(l - k)(l + k) ( jmax v max/ z max^ 
sat(cp,cp s ) = cp” ^"cp for 0 < cp < cp s 

= 1 for cp > cp s 

Control law (33) is a special case of the control law (30). Thus, (30) 
becomes (33) if the following identifications are made: 

- sat(cp;9g) > S 2 = ^ ~ ^) z max/ ^max 9 S 3 “ l/2 (l + k) z max /( Jxnax^max^ 

Equations (33); (25h), and (6b) imply that w < 0 for all R and 
w > w max * Therefore, the trajectory of the system cannot leave the region 

T = [(R,w a ) such that ( w a w a) 1 ^ 2 5 v max^ 

Equation (33) implies that ( z | z a ^ 1 ^ 2 — z max on 

Finally, according to the discussion of case II, the system is asymptoti- 
cally stable everywhere on X if 

l/2(l + k) z max / ( Jfl^x^max) ^ l|kg(0)|| 

which is true in the present case. Therefore, the system has the following 
properties : 

i) It is asymptotically stable on T. 

ii) The torque is bounded by z max on T. 

lii) If Winax = [1]^ - ||h s (0)||]/j max where h^ is a spherical 
bound on the momentum capacity of the reaction wheels , then the wheels 
will not saturate on T. 

iv) In the neighborhood of point (1,0) the system behaves as a three- 
dimensional second-order linear system with damping of 0*5 and natural 
f r equency of 0*5(1+ k ) ZjQgx. ( ^max^ max ^ * 

v) The plots in figures 2, 3; and 4 depict some of the results 
obtained from a digital computer simulation of an OAO-type vehicle* The 
inertia matrix J a was assumed to be diagonal, J a = (999;1110,l4l0)kg-m 2 ; 
the saturation limits for the momentum exchange and control torque were 
assumed to be h^^. = 4.68 N-m-sec and z max = 0*231 N-m, respectively. 
(See ref. 2. ) 
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Figure 2.- The response of the system 
in terms of the magnitudes of the 
control torque., angular velocity, 
and error angle. 
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Figure 3*- The response of the system 
in terms of the components of the 
control torque. 
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Figure 4 .- The response of the system in terms of the components of the 
relative momentum of the reaction wheels. 

At t = 0 cp = 2 rad, c^ - -3~ x/f 2 (±,l,l) j ^a = 0.5(l,-l,-l) mrad/sec. The 
wheels were assumed to he locked prior to t =0; 'w max = 2.6 mrad/sec. 

The control may he divided roughly into three parts. For 0 < t < 100 
sec ; the control generates a pulse-like torque to bring the vehicle to its 
maximum velocity. For 100 < t < 750 sec the vehicle coasts with maximum veloc- 
ity; the small torques probably counteract the gyroscopic effects. For 
t > 750 sec ; the control again generates pulse-like torques to stop the vehicle 
on target. After the transient, the initial momentum of the vehicle resides 
in the sheels (fig. 4 ). One may suppose that although the duration of these 
intervals may change with initial conditions, the general shape of the response 
does not. 
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CONCLUDING REMARKS 


The representation of input and output of an attitude control system by 
3X3 orthogonal matrices led to the definition of the system error as the 
3x3 orthogonal matrix (the error matrix) corresponding to the rotation (the 
error rotation) between the actual attitude of the controlled body and the 
desired attitude. Euler’s theorem on rotations was employed to express the 
error matrix in terms of four parameters, namely, the angle (the error angle) 
and the direction cosines of the real eigenvector of the error matrix. It was 
discovered that the error angle and the magnitude of the angular velocity of 
the controlled body are convenient variables in the construction of Liapunov 
functions for the process. A class of control laws was synthesized for which 
the control torque is a function of the real eigenvector of the error matrix 
and the angular velocity of the controlled body. Conditions for which any 
member of the class yields an asymptotically stable control system were stated. 

The results presented are not applicable to on-off control. If they are 
to be applied to a reaction jet control system, throttling or pulse width 
modulation schemes must be employed. The results are most applicable to 
reaction wheel control systems. It appears that they may be made to apply to 
other momentum exchange control schemes (i.e., control moment gyros). 

The errors and delays in the attitude sensors, computer, and motors were 
neglected in the analysis. To estimate the significance of such effects 
physical sensors and motors were placed on an air bearing platform, and the 
loop was closed with a real time digital computer. The simulation of the 0A0- 
type vehicle was repeated and no significant departures from theory (simple 
computer simulation) were observed. 


Ames Research Center 

National Aeronautics and Space Administration 
Moffett Field, Calif., July 1, 1966 
125 - 19 - 03-09 
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APPENDIX A 


NOTATION AND SPECIAL FUNCTIONS 


In this report vectors are denoted by lower-case letters with an overbar. 
Linear transformations are denoted by upper-case letters with an overbar. The 
term triplet (or basis) always means a right-hand triplet of orthonormal vec- 
tors. Several triplets are employed; a letter is associated with each basis to 
distinguish one from another. All triplets are erected at the fixed point of 
the controlled body. A basis vector is denoted by the letter ”u" with an over- 
bar and two subscripts; the first subscript indicates the basis to which the 
vector belongs, and the second subscript indicates the place the vector occupi 
occupies in the triplet. For example, the symbol u a2 stands for the second 
vector of the a triplet (or a basis). 


Column matrices representing a vector with respect to the various bases 
are denoted by the lower-case letter used to denote the vector, but without 
the overbar and with a subscript which indicates the basis. For example, y a 
stands for the column matrix representing the vector y with respect to the 
a basis. All columns are 3 by 1. 

A linear transformation which physically corresponds to a change of bases 
is referred to as a rotation, and a linear transformation which physically 
corresponds to a linear operation on a vector is referred to as a linear 
operator. A rotation is denoted by a capital letter with an overbar and two 
subscripts. The first subscript indicates the image of the basis denoted by 
the second subscript. For example, A as is a rotation such that u ai = A as^si> 
i = 1, 2, 3* The matrix representing a rotation such as A as will be denoted 

ty A as • 


A linear operator is denoted without any subscripts but with an overbar. 
For example, in the expression h = J w, J is the linear operator which takes 
the vector w into the vector h. The matrix representing a linear operator 
with respect to a basis is denoted by the letter denoting the operator, but 
unbarred and with a subscript which indicates the basis in which the repre- 
sentation Is being made. For example, J a is the matrix representing the 
operator J with respect to the a basis. 

The transpose and inverse of a matrix A is denoted by A^ and A -1 , 
respectively. The identity matrix is denoted by I. 

Let A ~ ( a ij) be an arbitrary 3x3 matrix, and let y = (y^) be an 
arbitrary 3X1 column matrix. Then, the functions of the matrix elements are: 

The trace of A, 

tr(A) = 


3 

^ a ii (Al) 

L = 1 
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The column function of A, 



CO 

CM 

- a 32 

S 31 

" a 13 

a !2 

-a 21 


(A2) 


The matrix function of y 


0 y 3 -y^ 

s(y) = I -y 3 o y ± 


^ -n 0 


(A3) 


Let a and. b be two arbitrary vectors. The two binary operations a . b 
and c = a X b have the following meaning with respect to an arbitrary (but 
orthonormal) basis , say the e basis: 


a . b = a e b e 

c = a x b c e = -S(a e )b e 


(A4) 

(A5) 


The following identities are used in the report. (For arbitrary columns 
matrices y and z and matrix A) 


S(y)z = -S(z)y 

(A6) 

q.[S(y)] = y 

(A7) 

tr[S(y)A] = -2y t q(A) 

(A8) 

Let B be an orthogonal matrix; then 


tr(BAB t ) = tr( A) 

(A9) 

q(BAB t ) = Bq( A) 

(A10) 

S(By) = BS(y)B t 

(All) 

Let c be a column such that c^c ~ 1; then 


S 2 (c) = -I + cc^ 

(A12) 
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APPENDIX B 

SOME PROPERTIES OF ROTATIONS 
THE MATRIX FORM OF THE KINEMATIC EQUATION 


Let the a basis be rotating with angular velocity w relative to the 
s basis, and let the matrix A as represent the rotation between the s basis 
and the a basis. Let P be an arbitrary point fixed in the a basis. The 
position of P will be denoted by the vector r; its velocity relative to the 
s basis will be denoted by the vector v. Then, r a = A as r s , v a = A as v s , 
r g = v s , r a = 0; and v a = -S(w a )r & . Consequently, the following chain of 
equations is true. 


® *a ^as r s + ^as^s 


^as^as r a + ^as v s 


^as A as r a + v a 


- r a A^ 

LA asas 


-s( 


w a)l r e 


Since the chain must be true for every point fixed to the a basis, it 
follows that the matrix form of the kinematic equation is 

A a s ~ ®^ w a^as (Bl) 


THE (cp,c) PARAMETERS 


Consider any rotation matrix A as . According to Euler's theorem on rota- 
tions, A^ s always has the eigenvalue +1. Hence, the transformation 
x -> A ag x may be thought of as a rigid rotation about the direction c 
(c t c = l) of the eigenvector of A as corresponding to the eigenvalue +1, 
through some angle <p. Therefore, the matrix A as may be considered to be 
the solution at r = cp of the following differential equation. 

^Lt ^as ) = S(c)A as 

with initial condition, A as = I at t = 0; that is, 

A as = exp[cpS(c)] = I + sin cpS(c) + (l - cos cp)S 2 (c) (B2) 

Property (A12) was used in the representation of the exponential. Thus, equa- 
tion (B2) defines the rotation matrix A as in terms of the four parameters 
(cp,c). 

Conversely, the (cp,c) parameters of a rotation matrix Aas m ay be 
determined from the elements of the matrix A as as follows. (See eq. (A2) 
for the definition of the function q(A) . ) 


22 


cp = cos -1 H [tr(R) - l]j- , 


0 < cp < it 


(B3a) 

c = esc cp q(A as ) 0 < cp < it (B3h) 

The singular cases must he considered separately. When cp = it the com- 
ponents of c are the solutions of the following set of equations. 

I c il = [| ( a ii + x ) 

while., for cp = 0, c is arbitrary. 

The four parameters (cp,c) are constrained by c^c = 1. 


11 / 2 


c i c j 2 a ij 


THE q PARAMETER 


When the rotation angle of A as is known to be restricted to the interval* 
0 < cp( l/2) jt , then the matrix A as may be defined by its skew- symmetric part 
only y as follows. Let (see definition (A2)) q = l(A as ). Then by equation (B3b) 


q = sin cp c 9 


4* .0 

q°q = sm cp 


Equations (B2) and (b 4) -imply that for 0 < cp < (l/2)it 


^as 


= I + S(q) + 


1 + (1 - q t q) 




i-l 


S 2 (q) 


(b4) 


(B5) 


The components of q constitute a set of independent coordinates of A as for 
0 < cp < (l/2)jt. Note that l(A as ) is a continuous function of the elements 
of the matrix A as for 0 < cp < (l/2)jt. 


THE KINEMATIC EQUATION OF THE (cp,c) PARAMETERS 

Let the matrix A as be defined in terms of the (cp,c) parameters as in 
equation (B2) . Since c is an eigenvector of A as ., 

A as c = c 

and SO; 

A as c + A as £ = 6 
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That is. 


S(w a )c = (I - A as )c = - sin cp S(c)c - (l - cos cp)S 2 (c)c 

But, according to (A12) S 2 (c) = - I + cc^,. while c^c = 0; hence, 

S(w a )c = - sin cp S(e)c + (l - cos cp)c 


and 


S(c)S(w a )c = (l - cos cp)s(c)c + sin cp c 

But, for any three columns x, y, and z, S(x)S(y)z = (x 1: z)y - (x^y)z. (This is 
the matrix form of the vector triple product identity.) Hence, 


Wa = cpc + sin cpc + (l - cos cp)s(c)c (B6) 

while the kinematic equation is 

<p = w a c = c^Wg. (B7a) 

c = \ S(w a )c + c °t(|)[w a - (w a c)c] (B7b) 

THE KINEMATIC EQUATION OF THE q PARAMETER 


If A as is defined in terms of its skew- symmetric part as in equa- 
tion (B3) , then for 0 < cp < it, 

w a = q + fi(l,4) , 4 = w„ + f 2 (q,w a ) 


(B8) 


where , 


fi(q.>4) = 


1 - q 2 + (1 - q 2 ) 


1/2 


(ii)* * 


1 + (1 - CL 2 ) 


l/2 


"I 


S(q)q 


and 


f 2 (q,V a ) - ~ S(w a )q - | 1 - (1 - q 2 ) 


, 1/2 


w a - 2 


1 + (1 - q 2 ) 


1/2 


-l 


(w a q)q 


It may be noted that both fi and f 2 are of order higher than one in ||q|| 
for ||q|| = 0, so that the linear part of (B8) is 


w a = 1 


q = w, 


a 


(B9) 
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APPENDIX C 


DERIVATION OF THE DYNAMIC EQUATION 


The equations relating the angular acceleration of the controlled rigid 
body to the controlling torque will be derived in this appendix. Two cases 
will be considered. In the first the controlling torque is external (i.e., 
control by means of a set of reaction jets). In the second the total angular 
momentum of the system is conserved, and the controlling torque is generated 
internally by a momentum exchange device (i.e., a set of reaction wheels). 


CONTROL BY MEANS OF EXTERNAL TORQUE 


Let the two vectors h and w be the angular momentum and angular velocity, 
respectively, of the vehicle with respect to inertial space. Let J be the 
moment of inertia operator of the vehicle. By definition, 


h = J w 


The above equation has the following representation in the s basis (inertial) 
space) and the a basis (the vehicle), respectively. 


hs ~ J s’W’s 


The matrix J a representing the moment of inertia operator of the vehicle in 
the body coordinates will be constant if, as will be assumed henceforth, the 
mass distribution of the vehicle is fixed. 

If the torque acting on the vehicle is denoted by z', it follows from 
Newton’s law that 


But ha = Aas^si hence. 


h s = z s 


^a™a ^as^s + ^as^s 


The above equation together with the kinematic equation (C2) implies the 
following dynamic equation (Euler’s equations of motion). 


w a = + J a 1S ( w a )j a W a 


(Cl) 
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The following inequality is useful for estimating the effectiveness of the 
nonlinear part of (Cl) 


dt H Wa l 


J z*. = 0 


< 3 3 ^ 2 Pi ||v a |r 


(C2) 


where 

Pi = jl 1 (j2 “ J 3 ) + J2 1 (j3 “ jl) + 33 (jl “ 32 ) 
and are the eigenvalues of J a - 

The inequality (C2) follows from the fact that (d/dt)||w a || = w a w a /||w|| ^ 


(C3) 


* w i*— 1 

and that the maximum of |w a J a S(w a )J a w a = Pi w al w £L2 w a3 on the sphere 


w a w a = H w all" is 3~ 2/3 Px ||w a | 


CONTROL BY MEANS OF INTERNAL TORQUES 


Consider three wheels oriented so that the spin axis of the ith wheel 
is parallel to the ith vector of the a basis (the vehicle). Let the 
moment of inertia of the ith wheel about its spin axis be denoted by jY* 

Let the matrix represent the moment of inertia of the vehicle plus locked 

wheels with respect to the body axes, and let J a be a diagonal matrix whose 
elements are the moments of inertia jY the wheels about their spin axes. 
Then the total angular momentum h of the system may be represented in the 
a basis as follows: 


, _v , T w w 

Ct Ct CL Qi CL 


where the column matrix w a has the elements w a i which are the spin veloci- 
ties of the wheels relative to the vehicle. But h a ~ A as h s . Hence, 


T v , T w w _ A , 
^a v a + ^a^a _ ^as^s 


(C4) 


Taking the time -derivative of equation (c4). 


_v. t T w.w A , , A : 

^a v a + ^a v a “ ^as^s + ^as^s 


Let the motor torque acting on the ith wheel be denoted by -z a ^. Then, 

J I(^a + * a ) = " z a 


where the column matrix z a has the elements z a q. Hence, 

w a = J a lz a + J a ls ^ w a^ -^as^s + J a A as^s 
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where , J a is the inertia of the vehicle plus locked wheels minus the inertias 
of the wheels about their spin axes; namely. 



If there is no external torque, the total angular momentum of the system is 
conserved, in which case the dynamic equation is the following: 

(C5) 


v a = Ja z a + J a s ( w a) A as h s , h s = constant 


The following inequality is useful for estimating the effectiveness of the 
nonlinear part of (C5) 


U I kill] < to IkJ Iks II ( 06 ) 

L dt Jz a ■ o 

where 

= IJl 1 - Ja 1 ! + Ua 1 - Ja 1 ! + l-C " 1 (C7) 

and jj_ are the principal inertias of J a . 

The inequality (c6) follows from the Schwartz inequality and the fact that 
ll w a J a ls ( w a)ll < ^ ||w a || 2 and ||A as h s || = ||h s ||. 


27 



APPENDIX D 


FORM OF THE TIME OPTIMAL CONTROL LAW 


Given the following system of differential equations, 

R = S(w a )R 

with the controlling variable z a restricted to the closed sphere 
The time optimal control maximizes the Hamiltonian (see ref. 8) 

H = tr(p t R) + y a v a 

where the matrix P satisfies the differential equation 

P = S(w a )P 

and the column matrix y a satisfies the differential equation 

y a = 


V 


(Dl) 

(D2) 


^a 1 1 


< z 


Equations (Dl) and (d 4) have the same transition matrix. Hence, 

P = RR t (0)P(0) 


max 


(D3) 


(D4) 


(D5) 


Consequently, 

P t R = P t (0)R(0)R t S(w a )R = P t (0)R(0)S(R t w a ) 


Hence, on taking the trace of the above equation and using equation (A8), one 
obtains the following result. 

/ "t.. 

tr(P R) = - w a Rk 


where k = 2q[P i: (0)R(0) ] . 

Therefore, the time optimal control has the following form 

z a = z maxy a Ayaya) l/2 

where the column y a is the solution of the following boundary value problem. 
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R = S(w a )R 

= az maxya/(yly a ) l/2 

y 8 . = “ Rk 


[R(0), w a (0 ) ] , and (l,0) define the boundary values. 
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